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What is claimed is: 

1 . A method of clustering genes, comprising: 

for each condition k of n conditions, determining a probability p ik of a first gene 
5 gi being in its induced state and a probability p jk of a second gene gj being in its induced 
state; 

deriving a contingency table for the first gene gi and the second gene gj based on 
the probabilities p ik and p jk ; 

deriving a mutual information M for the first gene gj and the second gene gj based 
10 on the contingency table; and 

clustering the first gene gi and the second gene gj based on the mutual information 
Mas a metric. 

2. The method of claim 1, wherein determining the probability p ik includes 
15 calculating the probability p ik based on a probability function. 

3. The method of claim 1, wherein determining the probability p ik includes 
calculating the probability p ik based on: 

Pik =S^( e ik 

20 wherein (x) = l/(l + e'^ ) , e ik represents an expression level of the first gene gi under 
condition k 9 and and d { represent parameters associated with the first gene gj. 

4. The method of claim 1, wherein determining the probability p ik includes 
calculating the probability p ik based on: 

25 P lk =r»/(r» + c»e e <>"), 

wherein r ik represents an expression level of the first gene gi under condition k y c ik 
represents a control expression level of the first gene gi under condition k, and /u i and 0. 
represent parameters associated with the first gene gj. 
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5. The method of claim 4, wherein //, = 1, and 0. = 0. 

6. The method of claim 1 , wherein deriving the contingency table includes 
5 calculating a 2x2 contingency table T ijtXy based on: 

( n n \ 



T.. = 



YjPikijh YjPikPjk 



\k=\ k=\ J 

wherein q ik = 1 - p ik , 9 yjt = 1 - , * ranges from 0 to 1 , and y ranges from 0 to 1 . 

7. The method of claim 6, wherein deriving the mutual information M includes 
1 0 calculating the mutual information M based on: 

jc=0,l>;=0,I X r jy 

wherein - T ijfXy /n,P Lx = Yjv* ln ' andP y> = /w • 

8. A method of clustering genes, comprising: 

1 5 for each condition A; of « conditions, determining a probability p ik of a first gene 

gi being in its induced state based on a first probability function and a probability p jk of a 

second gene gj being in its induced state based on a second probability function; 

deriving a contingency table for the first gene gi and the second gene gj based on 
the probabilities p ik and p Jk ; 

20 deriving a mutual information M for the first gene gi and the second gene gj based 

on the contingency table; and 

clustering the first gene gi and the second gene gj based on the mutual information 
Mas a metric. 

25 9. The method of claim 8, wherein at least one of the first probability function and 
the second probability function corresponds to a sigmoidal probability function. 
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10. The method of claim 8, wherein deriving the contingency table includes 
calculating a 2x2 contingency table T ijtXy based on: 

n \ 

n ' 

YjPikPjk 
*-l J 

1 - p jk , x ranges from 0 to 1 , and y ranges from 0 to 1 . 

11. A method of deriving a mutual information M for a first gene gj and a second gene 
gj, comprising: 

for each condition k of n conditions, determining a probability p ik of the first gene 
10 gi being in its induced state and a probability p jk of the second gene gj being in its 
induced state; 

deriving a 2x2 contingency table T ijfXy for the first gene gi and the second gene gj 
based on the probabilities p ik and p jk > wherein x ranges from 0 to 1, and>> ranges from 0 
to 1; and 

1 5 deriving the mutual information M for the first gene gi and the second gene gj 

based on the 2x2 contingency table Tij tXy . 

12. The method of claim 1 1 , wherein determining the probability p ik includes 
calculating the probability p ik based on: 

20 

wherein g (x) = l/(l + e'** ) , e ik represents an expression level of the first gene gi under 
condition k y and fi t and 0. represent parameters associated with the first gene gj. 

13. The method of claim 11, wherein determining the probability p ik includes 
25 calculating the probability p ik based on: 



T 



f n 
n 

V*=t 



5 wherein q ik q jk = 
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wherein r# represents an expression level of the first gene gi under condition k, Cik 
represents a control expression level of the first gene g, under condition k, and ju i and 6 i 
represent parameters associated with the first gene gj. 

14. The method of claim 11, wherein deriving the 2x2 contingency table T ijtXy includes 
calculating the 2x2 contingency table T iJtXy based on: 



T 



f n n \ 

k=\ t=l 
n n 



wherein q ik = 1 - p ik , and q jk = 1 - p jk . 

10 15. A method of generating a list of genes, comprising: 

providing a set of gene expression data associated with a plurality of genes under 
n conditions; 

selecting a first subset of gene expression data from the set of gene expression 
data, the first subset of gene expression data being associated with a first gene gi; 
15 selecting a second subset of gene expression data from the set of gene expression 

data, the second subset of gene expression data being associated with a second gene gj; 

for each condition k of the n conditions, determining a probability p ik of the first 
gene gi being in its induced state based on the first subset of gene expression data; 

for each condition k of the n conditions, determining a probability p Jk of the 

20 second gene gj being in its induced state based on the second subset of gene expression 
data; 

deriving a mutual information M for the first gene gi and the second gene gj based 
on the probabilities p ik and p Jk ; and 

based on the mutual information M, generating the list of genes indicating the first 
25 gene gj and the second gene gj. 

16. The method of claim 15, wherein determining the probability p ik includes 
calculating the probability p ik based on: 
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Pik =g»S e ik - 9 i)> 

wherein (x) = l/(l + e~ MiX ) , e,* represents an expression level of the first gene gi under 
condition k 9 and ju t and 0. represent parameters associated with the first gene gj. 

5 17. The method of claim 15, wherein determining the probability p ik includes 
calculating the probability p ih based on: 

p. = 

wherein represents an expression level of the first gene gi under condition k, c,* 
represents a control expression level of the first gene gi under condition k, and fi i and 0 t 
1 0 represent parameters associated with the first gene gi. 

1 8. The method of claim 1 5, further comprising: 

calculating a 2x2 contingency table Ty tXy based on: 

f n n \ 



T. = 



k=\ 



1 5 wherein q ik = 1 - p ik , q jk = 1 - p jk , jc ranges from 0 to 1 , and y ranges from 0 to 1 . 

1 9. The method of claim 1 8, wherein deriving the mutual information M includes 
calculating the mutual information M based on: 



,v=0.1y=0,l ' Pu X Pjy 



20 wherein P ijiXy = T ilxy /n,P u = In , and Pj, = /n • 

_y=0,l jc=0,1 



20. A computer-readable medium, comprising: 
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code to determine, for each condition k of n conditions, a probability p ik of a first 
gene gi being in its induced state and a probability p jk of a second gene gj being in its 
induced state; 

code to derive a contingency table for the first gene gi and the second gene gj 
5 based on the probabilities p ik and p jk ; 

code to derive a mutual information M for the first gene gi and the second gene gj 
based on the contingency table; and 

code to cluster the first gene gj and the second gene gj based on the mutual 
information M as a metric. 

10 

21 . The computer-readable medium of claim 20, wherein the code to determine the 
probability p ik includes code to calculate the probability p ik based on a probability 
function. 

1 5 22. The computer-readable medium of claim 20, wherein the code determine the 
probability p ik includes code to calculate the probability p ik based on: 

Pik =gvS e ik- 0 i)> 

wherein (x) = l/(l + e' MiX ) , e& represents an expression level of the first gene gi under 
condition £, and //. and 0. represent parameters associated with the first gene g. 

20 

23. The computer-readable medium of claim 20, wherein the code to determine the 
probability p ik includes code to calculate the probability p ik based on: 

wherein r ik represents an expression level of the first gene gi under condition £, c,* 
25 represents a control expression level of the first gene gi under condition £, and ju i and 0. 
represent parameters associated with the first gene gj. 

24. The computer-readable medium of claim 23, wherein /J i = 1, and 0 i = 0. 
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25. The computer-readable medium of claim 20, wherein the code to derive the 
contingency table includes code to calculate a 2x2 contingency table T^ xy based on: 
f n n \ 

n n 

wherein q ik = 1 - p ik , q jk = 1 - p jk , x ranges from 0 to 1 , and y ranges from 0 to 1 . 



T. = 



10 



26. The computer-readable medium of claim 20, further comprising: 

code to receive hybridization data of a sample nucleic acid sequence and nucleic 
acid probes, wherein the code to determine the probability p ik includes code to calculate 

the probability p ik based on the hybridization data. 

27. The computer-readable medium of claim 20, further comprising: 
code to interface with a hypertext transfer protocol server. 



28. The computer-readable medium of claim 20, further comprising: 
1 5 code to generate a hypertext markup language document indicating the first gene 

gi and the second gene gj. 
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